Scaling properties and universality of first-passage time probabilities in financial 

markets 
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Financial markets provide an ideal frame for the study of crossing or first-passage time events of 
non-Gaussian correlated dynamics mainly because large data sets are available. Tick-by-tick data of 
six futures markets are herein considered resulting in fat tailed first-passage time probabilities. The 
scaling of the return with the standard deviation collapses the probabilities of all markets examined, 
and also for different time horizons, into single curves, suggesting that first-passage statistics is 
market independent (at least for high-frequency data). On the other hand, a very closely related 
quantity, the survival probability, shows, away from the center and tails of the distribution, a 
hyperbolic t~^'^ decay typical of a Markovian dynamics albeit the existence of memory in markets. 
Modifications of the WeibuU and Student distributions are good candidates for the phenomenological 
description of first-passage time properties under certain regimes. The scaling strategies shown may 
be useful for risk control and algorithmic trading. 

PACS numbers: 89.65.Gh, 02.50.Ey, 05.40.Jc, 05.45.Tp 



I. INTRODUCTION 

The first-passage time (FPT) of a given stochastic pro- 
cess is a random variable representing the instant of time 
when the process first attains some preassigned "criti- 
cal" value. The study of problems related to first-passage 
time has a long tradition in many branches of science and 
engineering |2', 3]. In recent years, empirical as well as 
theoretical analysis on this kind of problems have gained 
a broader interest. Thus new multidisciplinary contexts 
like ionic transport in a bacterial porin [J] , protein fold- 
ing in a crowded cell environment [5|, epidemics spread- 
ing in human diseases or computer viruses and human 
mobility Q require a substantial analysis of FPT events. 
In many cases the customary assumption of an underly- 
ing uncorrelated Gaussian diffusion dynamics is no longer 
applicable [3, Q . This is certainly the case of financial 
markets because market fluctuations are away from be- 
ing Gaussian owing to fat-tails and the appearance of 
clustering structures with time [9|, Il0| . 

Unfortunately, the observation of first-passage time 
phenomena requires large databases in order to get re- 
liable statistics of the most extreme and, hence, rare 
events. However, high-frequency financial time series can 
be truly large usually containing millions of observations. 
This allows a rather complete estimation of FPT proba- 
bility distributions and their tail decay (the latter ruling 
extreme events) going thus beyond common approaches 
based only on the evaluation of first moments, i.e., the 
mean first-passage time and the mean exit time [11| . 

An additional focus of interest is linked to the financial 
trading industry. The most extreme and less probable sit- 
uations become crucial to calibrate default probabilities 
and hence obtaining alternative or improved credit risk 
estimations |12|. Less dramatic situations involving the 



over, intraday algorithmic trading strategies [15| of buy- 
ing or selling a given asset could learn from FPT statis- 
tics. Computer based trading is thought to be respon- 
sible for as much as 73% of trading volume in the US 
in 2009 [16]. Yet empirical FPT probabilities are quite 
unexplored in financial markets [ITI, HH although there 
are some works studying other extreme-time statistics 



pricing of American and other exotic options also need to 
measure statistics related to FPT events [Ij, |lj| . More- 



The two main goals of this paper are: (i) the esti- 
mation of FPT probabilities for a wide class of mar- 
kets and (ii) their subsequent interpretation and possi- 
ble classification as universal properties of the market 
as a whole. We will thus look at financial databases 
in a high-frequency resolution and thoroughly study 
the FPT statistics by taking transaction to trans- 
action data (not mid-price) of six futures contracts. 
These are: DeutscheMark-US Dollar foreign currency 
(USDM, 04/01/1993-09/12/1997), Standard & Poor's 
500 index (SP500, 04/01/1993-09/12/1997), Deutscher 
Aktiken index (DAX, 2007/02/13-2007/06/13), Dow 
Jones Industrial Average Index (DJI, 2006/03/01- 
2007/08/27), Euro-US Doflar foreign currency (EU- 
RUSD, 2007/08/01-2007/08/27), the Spanish index 
(IBEX, 2007/01/02-2009/12/30). Each database con- 
tains millions of transactions. For example, the IBEX 
data contains 4,613,250 non-simultaneous transactions. 
We are taking the nearest expiry future contract and 
look at intraday price statistics. We are excluding first 
and last 30 minutes of each trading day in order to avoid 
opening and closing anomalous efi^ects. We are also ex- 
cluding big jumps due to the roll on to the next nearest 
expiry future contract. 



II. ANALYSIS 

We will analyze the return defined as the logarithmic 
price change at time t: 

X{t)^\n[S{t)/So], (1) 

where Sq = S{0) is the price at the starting time t = 0. 



Scaling across different markets and time 
horizons 



As stated above, our main objective is estimating the 
FPT probability distribution, W{x,t), which is defined 
as the probability that the first-passage time to a target 
level X is less than t. In other words, W{x,t) is the 
probability that the return first crossed level x before 
instant t: 

^ ' ' yProh[X{t') <x,0<t' <t] if a; < 0. ^ ' 

Let us first focus on the behavior of the FPT probabil- 
ity across the different markets selected. Figure [1] shows 
W{x,t) for different markets at a time horizon < = 15 
min. Very similar results are observed for shorter and 
longer times. The discounting in futures contracts with 
respect the underlying spot price or the long-term trend- 
ing behaviour of futures could make look different the 
positive {x > 0) and the negative {x < 0) wings of the 
FPT probabilities but these effects are not strong enough 
observed in our results since we mostly focus on intraday 
time horizons. Great differences between positive target 
values and negative target values become noticeable only 
for much longer time horizons |35| . 

If we scale the target returns x of each market with its 
own standard deviation: 
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FIG. 1: (Color online) Log-log representation of FPT proba- 
bilities, W{x,t), for different assets as a function of return x 
(top) and as a function of the scaled return x/vt where vt is 
defined in Eq. Q (down) when f = 15 min. We label positive 
target values, a; > 0, with {+) and negative ones, a; < 0, with 
(-). Solid line corresponds to the Wiener case ^. The inset 
in the down figure shows the difference W{x,t) — WG{x,t) 
with the FPT probability of the Wiener process WG{x,t). 



V, = ViXit)^) {X{t))\ 



(3) 



then Fig. [T] shows a fairly neat collapse into a single FPT 
probability curve after plotting W{x^t) in terms of the 
scaled returns x/vt- We observe that this is so despite 
the highly non-Gaussian character of data. We remind 
at this point that the Wiener model 



dXait) = adW{t), 



(4) 



where dW{t) is a Gaussian stochastic process with zero- 
mean and variance given by di, has the following FPT 
probability [2||: 



WG{x,t) = erfc 



(x/^/2^' 



(5) 



Notice that scaling enhances the distinction between 
the empirical results and the corresponding Wiener 
model with unit variance (see the solid line in the right 



frame of Fig. [T|). However, collapse still holds for tar- 
get levels as high as 10 times the size of the (empirical) 
15-minutes standard deviation. The same behavior ap- 
pears for shorter and longer time horizons (from 1 to 120 
minutes) as shown in Fig. [21 

The inset in Fig. [1] shows the difference W{x, t) — 
Wcix^t) for each market as a function of the scaled re- 
turn x/vt- As we can see the Wiener model systemat- 
ically underestimates the empirical FPT probability for 
large target values (a; > wj) but somewhat overestimates 
W{x,t) for smaller levels {x < O.lwt). This remarkable 
result had been theoretically predicted by two of us us- 
ing more sophisticated models such as the CIR-Heston 
stochastic volatility model [l7|. We believe that this 
fact r nay have non-trivial consequences in risk manage- 
ment [il[i2. 

Our next step is to select a particular market and ob- 
serve how the FPT probability changes with time. Fig- 




FIG. 2: (Color online) Log-log representation of the FPT probabilities as a function of the scaled return and with different 
times. Solid line represents the Wiener case given by Eq. ^. Insets show the difference W(x,t) — WG{x,t). 




FIG. 3: (Color online) Log-log representation of the survival probabilities as a function of an scaled time r — {vo/x)^t and for 
several target values x measured in units of the 30- minutes standard deviation vq. The insets show the same results with time 
measured in seconds and in a semi-log scale. Solid line represents the Wiener case when x = vq. 



lire [H shows market data for different values of t, from 1 
to 120 minutes. For large time horizons (up to 2 hours), 
the insets of Fig. [5] show that when x < vt the under- 
estimation of the Wiener model tends to disappear. In 
contrast, when x > Vt the overestimation persists and 
becomes more pronounced as time increases. It is also 
worth noticing that low resolution in price changes (as 
is the case of USDM) is also undervaluing W{x,t) when 



x/vt < 1. 

Let us mention that the relative collapse shown in 
Fig. [2] indicates that, for high frequency data, the tail of 
the crossing probability W{x, t) is, when properly scaled, 
independent of the time horizon. Moreover, the fact that 
this collapse is shared by all markets herein considered 
strongly suggests that W{x, t) is market independent as 
shown in Fig. [T]for the time horizon of 15 min. 



B. Scaling across different target levels 

Another interesting and alternative way of observing 
extreme-time events is to look for the the survival prob- 
ability (SP), that is, for the probability S{x,t) of not 
having reached level x before time t: 



S{x,t) 



f Prob[X(i') < x, < t' < <] if X > 
\Pioh[X{t') >x, Q<t' <t] if a; < 0. 



(6) 



Note that the SP is related to the FPT probability by 
the simple relation S{x,t) = 1 — W{x,t). Figure |2] shows 
how the SP decays with time for several target levels 
measured in 30- minute standard deviation units vo- As 
shown in Fig. [31 a collapse is still possible as long as we 
use the dimcnsionless time 
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(+) (-) 


(+) (-) 


EURUSD 0.0012 0.0012 


0.1150 0.0858 


USDM 


0.0023 0.0020 


0.1299 0.1772 


SP500 


0.0019 0.0020 


0.1181 0.1778 


DJI 


0.0018 0.0018 


0.1042 0.1829 


IBEX 


0.0011 0.0014 


0.1831 0.1396 


DAX 


0.0007 0.0010 


0.0871 0.1542 



TABLE I: Dispersion of collapsed FPT and SP curves shown 
in Figs. [2] and O using the form described by Eq. (|10|l . 



with a previous work of two us in the context of the Mean 
Exit Time ,21,1. 



(t-o/a;)2t. 



(7) 



The collapse holds for not very small levels. The smallest 
level, X = O.Olwo, is affected by the presence of finite tick 
size and by the price autocorrelation 0, [13] ■ Addition- 
ally, the lack of data affects the largest levels or the data 
markets with a shorter period of time. See the case of 
EURUSD data which is due to holding a too short time 
period (26 days). 

The SP decays as t"^/^ except for the EURUSD case. 
This hyperbolic behavior with time also holds for the 
Wiener model (see Eq. ([5|)) although the latter system- 
atically and drastically underestimates large time hori- 
zons. In fact, the hyperbolic decay behavior is the 
one predicted by the Sparre-Andersen theorem which 
holds for any sym metric Markov process including Levy 
flights 0, I32H34I . The theorem poses now the question 
about how important are correlations in our data. Trans- 
action to transaction has been then shuffled in two dif- 
ferent ways. A first choice randomizes the ordering of 
return changes while keeping the inter-transaction times 
as they appear in original data. A second choice consists 
in randomizing the ordering of inter-transaction times 
while keeping the original structure consecutive price re- 
turn changes. 

Figure S] shows the resulting FPT distributions after 
applying the proposed shuffles in returns and in the inter- 
transaction times. It shows that these two different ma- 
nipulations lead to two different effects. The random- 
ization in price returns ordering clearly affects the FPT 
statistics when varying threshold distance. In this case, 
the tail of the W {x, t) distribution drastically diminishes 
to almost the exponential form while the S {x, t) curve 
keeps the same hyperbolic decay when varying time hori- 
zon. On the other hand, shuffling the inter-transaction 
times have little effect to most of the markets and keep 
the extreme statistics curves almost identical. These re- 
sults are similar in the other time horizons and other 
target returns studied above. The conclusion that price 
returns ordering are more relevant than inter-transaction 
time ordering in the extreme times analysis is consistent 



C. Measuring the dispersion of each collapse 

We will now give a quantitative measure for the quality 
of each collapse by means of a dispersion relation among 
the different curves. Taking FPT data of a given mar- 
ket at different time horizons (see Fig. [5]) we define the 
following dispersion measure: 
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1 *^ 
— ^[M^(x„t,)-M.j', (8) 



i=i 



where M = 7 is the number of different time horizons 
considered, ^xt is the quantity 



1 ^' 
-J2wix.,t,), 



(9) 



j=i 



Axi is the bin width, N is number of bins and L — NAxi 
is the largest target value x considered. 

We can also perform a similar evaluation using the 
SP data shown in Fig. [3] with a dispersion Qt defined 
accordingly: 



1 *^ 



H '\ 



1 ^ 
-^[VF(x.,i,)-M*J" 



(10) 



where iV = 7 is the number of different target values 
considered, ^t - is the quantity 



1 ^'' 



(11) 



Atj is the bin width, M is number of bins and T — 
MAtj is the largest time horizon t considered. The two 
measures of dispersion are summarized in Table ID 

We can also evaluate the dispersion for the collapse 
of the several markets shown in Fig. [T] (right). In this 
case, and using an equivalent dispersion measure as that 
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FIG. 4: (Color online) Log-log representation of the FPT probabilities after shuffling price increments (green dashed line) and 
interevent times (blue dotted line) in comparison with original curve (red solid line) and Wiener case given by Eq. ^ (black 
solid line). For the sake of clarity we take only the 900 seconds time window case when varying x target and 0.5i)o case when 
varying t in the inset. 



of Eq. (UnD, we have 9+ 



0™.* = 7.99 X 10-4 



mkt 



a; < 0). We see that 



7.12 X 10 '^ (x > 0) and root term) Student distribution: 

Wstu{x,t)^ (l + x/V^) ". (14) 



(12) 



which clearly proves that the best collapse is between dif- 
ferent markets and the worst is between time horizons of 
the SP, as otherwise expected from the visual perception 
of Figs. \M\ 



D. Phenomenological expressions 

We finish by presenting two phenomenological expres- 
sions for the FPT probability aimed to reproduce em- 
pirical observations. A first mathematical expression is 
provided by the Weibull distribution which has been typ- 
ically a good candidate for adjusting extreme events in 
a large variety of data sources [13, UM ■ ^^ propose the 
following modified distribution 



W^wci(x,t)=e-(^/^)' 



(13) 



where the modification to the WeibuU's standard form 
consists in the ad-hoc addition of a square root term 
which gives the time dependence of the distribution; a 
term added to reproduce the observed decay in empirical 
data (see Fig. [3]). In Fig. [5] we check this adjustment to- 
gether with the following modified (again with a square 



The IBEX and DJI results for positive returns are 
shown in Fig. [5l We then use the same parameters val- 
ues to plot both the FPT and the SP curves (inset) being 
fairly consistent. Table |ll] shows the fitting parameters 
for all data sets considered. Note that all Weibull ex- 
ponents are close to /3 = 1 (the mean exponent of all 
markets is /3 = 1.04) which corresponds to the Laplace 
distribution. Also all power-law exponents of the Stu- 
dent distribution are rather similar: a little above a = 3 
(the mean exponent is a = 3.32) which is very close to 
the decay exponent for the market unrestricted proba- 
bility (not the FPT probability) proposed by Plerou and 
Stanley [22,]. 

Finally, Fig. \5\ clearly shows, as otherwise expected, 
that the Student distribution better describes the tails of 
the empirical curves than it does the Weibull distribution 
while the situation is reversed for smaller returns, where 
Weibull performs better than Student, the crossover be- 
ing approximately at |a;| ~ 5vt. Therefore, models pro- 
posed in future to explain FPT statistics should follow 
this very schematic for small (Weibull) and large (Stu- 
dent) target levels. As to the unrestricted return dis- 
tribution case, it has not been possible to fit a single 
distribution for the whole curve. 
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FIG. 5: (Color online) FTP probability and SP fits of DJI 
and IBEX using Weibull ITS]) and Student (fTi)) functions in 
terms of scaled return and scaled time (insets). 



Wvci{x,t) 



W,tuix,t) 



p 



b X lQ-^s~ 



EURUSD 0.93 ±0.01 2.87 ±0.18 

USDM 0.98 ±0.01 1.18 ±0.05 

SP500 1.01 ±0.02 1.31 ±0.05 

DJI 1.03 ±0.01 1.10 ±0.05 

IBEX 1.10 ±0.01 1.08 ±0.01 

DAX 1.21 ±0.01 1.05 ±0.02 



3.49 ± 0.03 
3.28 ±0.01 
3.49 ±0.01 
3.21 ±0.02 
3.15 ±0.02 
3.32 ±0.02 



24.9 ±1.1 
9.4 ±0.6 
12.2 ±0.9 
8.7 ±0.8 
8.4 ±0.6 
9.7 ±0.7 



TABLE II: Parameters of the Weibull ^ and Student HH) 
distributions simultaneously fitting FPT probability and SP 
for a; > 0. 



III. CONCLUSIONS 

To conclude, even though it is a rather unexplored 
context, financial markets provide an ideal frame for the 
study of first crossing events of non-Gaussian correlated 
dynamics essentially because large data sets are available. 
We have worked on transaction-to-transaction data of six 
different futures markets and have studied the form of the 



empirical FPT probabilities. We have obtained different 
scaling strategies that allow us to collapse these otherwise 
scattered probabilities into a single curve. 

As a function of the target return x, the FPT prob- 
ability W{x, t) depends not only on the time horizon t 
but also on the market selected. This triple dependence 
yields different scaling strategies. The first and most 
effective scaling corresponds to the FPT collapse, at a 
given time window and varying threshold distance, across 
the six different markets surveyed (Fig. [T]). This is done 
by normalizing target returns with their own volatility, 
calculated as mentioned before. It is then possible to 
see that, as likewise expected, Gaussian diffusion models 
underestimate the probabilities of large returns. How- 
ever, and quite unexpectedly, they overestimate empiri- 
cal FPT probabilities for small target returns. We believe 
that this should have substantial practical implications in 
risk management and control. 

A second scaling attempt is made by selecting a sin- 
gle market and looking at the dependence of W{x, t) on 
the target return for different time horizons. A fairly 
good collapse of probabilities is observed when returns 
are again scaled by the standard deviation correspond- 
ing to each time horizon. This second scaling allow us to 
observe how the previously mentioned overvaluation and 
undervaluation changes with time (see inset of Fig. [3]) 
but has essentially the same behavior in the tail. 

The third and last scaling strategy is addressed to the 
survival probability. In this case we set the target re- 
turn X at some value and observe the time evolution of 
the SP. Collapsing curves are then obtained by a time 
scaling provided by diffusion theory (cf. Eq. ([7])) even 
though diffusion behavior is not sustained by empirical 
data. Looking at Fig. [3] we see that the SP collapse 
works properly, showing the diffusive hyperbolic decay 
t^^", only if target returns are neither very small nor 
very large compared to their standard deviation. A little 
thought shows that this is the expected behavior since, 
as is well known, markets are approximately Gaussian 
away from the tails [lO|, |30| and the center of the distri- 
bution [13, im. 

The Sparre Andersen theorem [33, [33] claims that 
Markovian processes including the Levy flights have a SP 
with the hyperbolic decay t~^''^ . We have performed the 
exercise of shuffling our transaction-to-transaction data 
in two different ways with the aim of breaking memory in 
our market data. Figure S] shows that shuffling in price 
changes ordering have a much more important effect than 
inter transaction time ordering. The return suffling dras- 
tically diminishes the tail decay of the FPT statistics (cf. 
Fig. [4]). The hyperbolic decay i^^/^ of the SP is preserved 
after the two shuffling methods. 

Based on the scahng strategies obtained, we study 
modified versions of the Weibull and Student distribu- 
tions for a phenomenological description of the empirical 
observations. We have seen that for large returns FPT 
probabilities are better adjusted by a Student distribu- 
tion, while probabilities corresponding to intermediate 



and small returns are better described by a Weibull (al- 
most Laplacian) distribution. This should come as no 
surprise, since the Student distribution possesses fat tails 
while that of Weibull does not. Moreover, for the markets 
herein considered, tail exponents of the Student distribu- 
tion are tightly packed around their mean value a — 3.32, 
which is very near to the tail exponent a — S obtained by 
Plerou and Stanley [20| for the unrestricted probability 
of several unrelated markets. This seems to indicate a 
kind of universal behavior of markets not distinguishing 
among restricted probabilities (i.e., FPT distributions) 
and unrestricted ones. 

Let us finally remark two additional universality as- 
pects that may be shared by all financial markets. First, 
note that under the assumption that data is adjusted 
by a Weibull or a Student distribution, or by a mix- 
ture of them, our scaling strategies suggest the possibility 
of obtaining the whole first-passage time probability by 



only estimating the volatility with no need of any other 
additional data. Whether or not this can be extended 
to any market should be extensively checked on empir- 
ical data and this is beyond the bounds of the present 
work. Second, and following this somewhat speculative 
reasoning, we also note that the neat collapse of the FPT 
probabilities for different markets (see Fig. [1]) is a sub- 
stantial indication that extreme events statistics are, in 
high-frequency data, market independent. This guess, 
which sustains a truly universal property, should also be 
checked on wider markets and wider periods of time. 
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